Global-scale distributed I/O with ParaMEDIC

نویسندگان

  • Pavan Balaji
  • Wu-chun Feng
  • Heshan Lin
  • Jeremy S. Archuleta
  • Satoshi Matsuoka
  • Andrew S. Warren
  • João Carlos Setubal
  • Ewing L. Lusk
  • Rajeev Thakur
  • Ian T. Foster
  • Daniel S. Katz
  • Shantenu Jha
  • K. Shinpaugh
  • Susan Coghlan
  • Daniel A. Reed
چکیده

Achieving high performance for distributed I/O on a wide-area network continues to be an elusive holy grail. Despite enhancements in network hardware as well as software stacks, achieving high-performance remains a challenge. In this paper, our worldwide team took a completely new and non-traditional approach to distributed I/O, called ParaMEDIC: Parallel Metadata Environment for Distributed I/O and Computing, by utilizing application-specific transformation of data to orders of magnitude smaller metadata before performing the actual I/O. Specifically, this paper details our experiences in deploying a large-scale system to facilitate the discovery of missing genes and constructing a genome similarity tree by encapsulating the mpiBLAST sequencesearch algorithm into ParaMEDIC. The overall project involved nine computational sites spread across the U.S. and generated more than a petabyte of data that was “teleported” to a large-scale facility in Tokyo for storage.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed I/O with ParaMEDIC: Experiences with a Worldwide Supercomputer

Achieving high performance for distributed I/O on a wide-area network continues to be an elusive holy grail. Despite enhancements in network hardware as well as software stacks, achieving high-performance remains a challenge. In this paper, our worldwide team took a completely new and non-traditional approach to distributed I/O, called ParaMEDIC: Parallel Metadata Environment for Distributed I/...

متن کامل

ParaMEDIC: Parallel Metadata Environment for Distributed I/O and Computing

BLAST is a widely used software toolkit for genomic sequence search. mpiBLAST is a freely available, open-source parallelization of BLAST that uses database seg-mentation to allow different worker processors to search (in parallel) unique segments of the database. After searching , the workers write their output to a filesystem. While mpiBLAST has been shown to achieve high performance in clust...

متن کامل

Global rating scale for the assessment of paramedic clinical competence.

OBJECTIVE The aim of this study was to develop and critically appraise a global rating scale (GRS) for the assessment of individual paramedic clinical competence at the entry-to-practice level. METHODS The development phase of this study involved task analysis by experts, contributions from a focus group, and a modified Delphi process using a national expert panel to establish evidence of con...

متن کامل

Assessment of non-clinical attributes in paramedicine using multiple mini-interviews.

BACKGROUND Non-clinical attributes are increasingly emphasised as an important factor in paramedic practice. However, the assessment of these attributes often lacks the evidence base to support it. Exploring the relationship between non-clinical attributes and clinical skills is also of theoretical and practical importance. OBJECTIVE To first seek evidence of reliability and validity for the ...

متن کامل

Review article: Paramedic education opportunities and challenges in Australia.

Paramedic education has been undergoing major development in Australia in the past 20 years, with many different educational programmes being developed across all Australian jurisdictions. This paper aims to review the current paramedic education programmes in Australia to identify the similarities and differences between the programmes, and the strengths and challenges in these programmes. A l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Concurrency and Computation: Practice and Experience

دوره 22  شماره 

صفحات  -

تاریخ انتشار 2010